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Abstract 

We determine the limiting empirical singular value distribution for random uni- 
tary matrices with Haar distribution and discrete Fourier transform (DFT) matrices 
when a random set of columns and rows is removed. 

1 Introduction 

An n X n Hermitian matrix A determines a distribution on the real line by 

1 " 



n 

i=l 



where Ai, A„ are the eigenvalues of A. Wigner was the first to determine the limiting 
behavior of such a distribution when the matrix A is random [18J. He initially considered 
symmetric matrices with O's on the diagonal and independent plus or minus I's in the 
upper-triangle and showed that when scaled by the empirical distribution converges 
in probability to the Semicircular Law 



fw{x) 



7r-\/4 — when Ixl < 2 

ZTT II 

otherwise 



Wigner later proved that the same limiting distribution holds for symmetric random vari- 
ables with finite moments [,19J . 

The second classical type of random matrix is the Wishart matrix |EQ1. Let H E 
Qmxn j^^yg independent Gaussian entries with variance ^ . Then HH* is called a Wishart 
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matrix. Marcenko and Pastur showed that the empirical distribution of HH* converges 
in probability to 



fHH*{x) = max(0, 1 - TTZZr^ -^kc+l^ 



where c± = (1 — a/^)^ [JjDJ- Independently, Silverstein and Grenander used a similar 
technique and proved almost sure convergence |l9l. 

This paper applies the approach of Marcenko and Pastur to a question originating in 
geometric functional analysis: we address the singular values of random submatrices of 
random unitary matrices with Haar distribution and of discrete Fourier transform (DFT) 
matrices when a random subset of columns and rows is removed and determine their 
limiting empirical singular value distribution. Let Un denote the n x n random unitary 
matrix with Haar distribution, and f/„ a realization of this random matrix. Let T and Q 
be subsets of {1, n}. We define Uqt to be the matrix obtained from f/„ by removing 
rows with indices not in and columns with indices not in T. The n x n DFT matrix 
has entries 

F, - J_„-27ri(j-l)(fc-l)/n 



F is unitary, and we define Fqt analogously to Uqt- 

We show that when each index is included in ^7 independently with probability (1 — 
and in T independently with probability (1 — p), then the limiting empirical distribution 
of Unri^^T ^^'^ FqtFqt depends only on the parameters p and q and converges almost 
surely to 



f , ^ V '^•^ ^^^^ T ( \ I ™ax(0, 1 - (p + g)) 

fp,qVn = 1^T^ T7i 1 ■ ■^(r--,r+)(2;) + -, ^ <5(x - 1), 

27r(l — — max(|?, gjj 1 — max(p, (7) 



where 
and 



This is formally stated as Theorem 13. 1[ 

The eigenvalue distribution of random unitary matrices with a fixed proportion of 
the bottom rows and right-most columns removed has already been studied. In the case 
when these proportions are equal, i.e. when the resulting matrix is square, the limiting 
empirical eigenvalue density was derived in [22 J, which builds on the work in fTl\. 

The interest in the spectrum of these matrices from the perspective of geometric 
functional analysis is in the largest and smallest eigenvalues of Wn^W^y and Fq^Fut, 
initially perhaps asymptotically, but ideally for finite dimension. These eigenvalues are 
related to discrete uncertainty principles, as well as random projections and embeddings. 
More discussion of their significance is given following Theorem 13.11 The first works 
on the extremal eigenvalues in the Wishart case were [ISIlIlEl, and in the Wigner case it 
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was [2J. It is important to note that the limiting empirical distributions were determined 
in these two cases before the behavior of the extremal eigenvalues was proved and were 
essential in that effort. We hope that the distribution presented here leads to similar 
developments. 

1.1 Notation 

To make notation easiest, a single subscript will denote the dimension of a square matrix, 
while a double index will refer to an entry of the matrix. Thus Fn will denote the n- 
dimensional DFT matrix, and 




-27Tiij-l)(k-l)/n 



will denote its entry at index {j,k). Un will denote the random unitary matrix with Haar 
distribution and of dimension n and Un will denote a realization of this random variable. 

When we want to make the original dimension apparent, we write -Fb„T„- Wqt and 
Un^T„ denote the analogous random variables for the Haar case. We will use V to denote 
an arbitrary unitary matrix. 

We find it helpful to also work with matrices with rows and columns set to zero rather 
than removed. For clarity we make the following definitions. 

Definition 1.1 A square matrix is called a diagonal projection matrix if its off-diagonal 
entries are all zero and its diagonal entries are zero or one. 

Definition 1.2 A random diagonal projection matrix will be called a Bernoulli diagonal 
projection matrix if the diagonal entries are independent and equal to 1 with probability 
1 — p and equal to with probability p. 

The matrices P„ and Qn will denote independent Bernoulli diagonal projection ma- 
trices. Asymptotically, P„ and Qn randomly "erase" the percentage p and q respectively 
of a vector. For a matrix A, A* denotes the conjugate transpose of A. 

Note 1.3 Throughout this paper we take the square root of a complex number to be 
uniquely defined by having argument in [0, tt). The reader will see that this is justified. 

Note 1.4 When either p or q is or 1, the corresponding matrix is trivial. For the 
convergence of several sums in later proofs, we assume that p, q G (0, 1). 



2 The Stieltjes and 77 Transforms 

Our main tool is the Stieltjes transform, which is only defined for real random variables. 
Thus, we will determine the limiting eigenvalue distribution of 

P F O F*P 
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which of course is real and contained in [0, 1]. 

The Stieltjes transform of a real random variable X with distribution function Fx (x) 
is a function mx : C+ — )• M defined by 



mx{z) = Ex 



X 



If Fx is continuous at x, then fx{x) can be recovered by the Stieltjes inversion for- 
mula ffl 

fx{x) = — lim 53 mx{x + iu). (1) 

We will determine the Stieltjes transforms of PnFnQnF*Pn and PnUnQrJ^nPn by first 
using the //-transform, which was introduced by Tulino and Verdu in [|T6l . For a real 
valued random variable X, the //-transform is also a function rjx '■ C+ — )■ M defined by 



r]x{z) = Ex 



1 + zX 

Note that for z in an appropriate region of convergence 



mx(z 



^ oo 

--E( 



fc=0 



and 



so that 



A;=0 



mx{z) 



—Vx[ — • 

z z 



(2) 



(3) 



In this section we determine the //-transform for the matrices PnFnQnF*Pn and 
PrtUnQn^n^n, Proposition |2.6[ Wc require several lemmas en route to this proposition. 

Lemma 2.1 Let P be a mean-zero, random diagonal matrix with independent entries in 
[—1,1] and of dimension n. Then there exists a constant Cm such that for dimension n 
and all 1 < i, j < n, 

E\F*PF\T, < Cmu"^!'^. 



The constant Cm increases with m. 



Proof Since F*PF is Toeplitz we look at (F*PF)o,i = - ELi ^Me^^^^''"^^^''^^^"- 
Set Yk = 3?Pfc,fce2^*(^- and Zk = S^P^./.e^^^^'^-^^C-^)^. The Yk^ k = 1, ...,n, are 
independent random variables, and — 1 < Va,. < 1 for all k. The same also holds for the 
Zk. Since the P^ are mean-zero, E Ylk=i ^fc = ^^'^ ^ Yl'k=i = 0. We have 

n\lj:,iY,+^z,)\>t) < n\\j:Y,\>i.yj\\j^z,\>i. 



k=l 



k=l 



k=l 



V2' 
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k=i ^ k=i ^ 

< 4e * 4 



where the last inequality is Hoeffding's inequality. 
Lastly, 



E|-V(n + zZ,)r = / ms"'--'¥{\-y2iYk + tZk)\>s)ds 

1,-1 u I — 1 



k=l k=l 

2 

< 4:171 s e 4 







/oo / . \ (m-l)/2 

2™-^ (^^J e-\nt)-'/^dt 



oo 





^ 2 ^ 

a 

For the Haar case we use the work of Gamaev and Gluskin f5l iTll. The probability 
given in Theorem 3 in [|71| combined with the main theorem of [5] give the following 
theorem. Here denotes the unit ball in /^(R'"), and Ph denotes the orthogonal 
projection onto the subspace H. 

Theorem 2.2 (Main Theorem in [5]) Let H be an m-dimensional subspace o/M", m < 
n, with Grassmanian distribution. Then there exist absolute constants c and M depend- 
ing only on the ratio ofn to m such that with probability 1 — e~^" 

M 
'n 



Lemma 2.3 Let P be a mean-zero random diagonal matrix with independent entries in 
[— 1, 1] and of dimension n. Then there exists a constant C„i such that for dimension n 
and all 1 < i, j < n, 

Ep^u\[U*PU],,,r<C^n-"'/\ 
The constant Cm increases with m. 

Proof We set Tij = [U*PU]ij. We take c to be the constant given by Theorem [Z2l for 
a random 2-dimensional subspace of a larger dimension mo for some large mo. Since c 
is non-decreasing as 2/n — )• 0, we may use the decay constant for an arbitrary large mo. 
M is also the constant given by Theorem l2.2l for a random 2-dimensional subspace of a 
larger dimension mo. Denote by Ui the i*'^ column of U. 
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= E[|rij|™| ||m,||oo < M and \\uj\\oo < M] ■ P(^| \\ui\\oo < M and ||uj||oo < 

+ E[|r,jr|| \\ui\\^>M or > M] ■P(^| \\ui\\^> M or > m) 

< E[|rij|"|| llwilloo < M and ||m^-||oo < M] + e""". 

Set Yfe = n^Pk^kUlfJJkj and = n'^Pk,kU*kUk,j- The Y^, /c = 1, are indepen- 
dent random variables, and — < < for all k. The same also holds for the 
Zk. Since the are mean-zero, E^^^^ = and E^^^-,^ = 0. Now we may 
repeat the calculation used in the proof of Lemma [2T| keeping in mind the factor M^. 
We incorporate e"*^" into □ 

We again use V to represent either a DFT matrix or a random unitary matrix. We then 
define the matrix Wn = Pnyn\ however, in what follows we will not write the subscript 
n. We denote the i^^ column of Wn denoted Wi, and define the following quantity. 

A = I + zPVQV*P-zQi^iWiW* (4) 
= I + z ^ QjjWjW*. (5) 

Lemma 2.4 In both the DFT and the Haar cases, for \z\ < 1 the random variable 
w*A^^Wi defined in equation (|5l) equals a deterministic constant D{z) independent of 
the dimension n plus a random part that depends on the dimension and that converges 
almost surely to with respect to dimension independent of the index i. That is, in 
dimension n, w*A^^Wi = D{z) + X„ j, where D{z) is independent of n and 

Proof We arbitrarily select an index i and denote iii* . If |z| < 1, then for any realization 
of Pn and Qn every entry of the following sum converges: 

W*A-}W = V*P{I + zPVQV*P - zQ(^i.^i,)W*w*.)-^PV 

oo 

= V*Pj2i-^)^iPyQV*P - zQ(i.,i*)W*w*,fPV. 

k=0 

For \z\ < 1, for any 5 we may choose K such that 

oo 

\[V*P J2 {-z)''{PVQV*P)''PV]^,,,,,) \ < 6 

k=K+l 

for any realization of Q, independent of n. For now we just take K to be a large inte- 
ger. Observe that equation dH) is equivalent to requiring the random matrix Q to have a 
deterministic zero at the entry {i*, i*). Therefore we set Qi^i = Qi^i — [1 — q) iox i ^ i* 
and Qi^^i, = -(1 - q). Note that PV{1 - q)IV*P = (1 - q)P. For a fixed K we now 
consider 

K 



v*pY,{-z)\pyQy*p - zQ^i,^i*^w*w*,fpv 



k=0 
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K 



^{-zfV*P[PVQV*P + (1 - q)PfPV 

k=0 
K k 

^{-zf ^1 .1(1- qyV*P[PVQV*Pf-^P^PV 
k=0 j=o ^-^^ 

K k 

^{-zf 5Z I ) (1 - qyV*P[VQV*Pf-W. 



k=0 j=0 

Now we center the P matrices. Set P = P — (1 — p)I. Then 

V*P[VQV*PfV = V*{P+{l-p)I)[VQV*{P+{l-p)I)fV 



^(1 - p)''+^-\''\V*P'^''VQV*P'^^VQ ■ ■ ■ P'^^V 



for ttO) •••5 ttfc equaling or 1 and |a| equalling the sum of the aj's. In the case that all a/s 
are 0, we recall that Qi*^i* = — (1 — g), and thus the (i*, i*) entry of (|6]) is deterministic in 
that case. Thus, the entry of Q equals a constant independent of n plus a linear 

combination of the entries of matrices of the form 

V*P°ovQV*P'''VQ ■ ■ ■ P'^'V 

with a, 7^ for at least one i. We continue to center each random diagonal matrix in 
this way such that eventually we only have constant terms, independent of n, and terms 
of the form 

^*p{i)^g(i) . . . '^/*pWyQ(fc)y*p(fc+l)y (6) 

for some centered matrices (except for the (i*, 2*)-entry) P''^\ P^'^+i) and Q''^\..Q^''^ 
and some 1 < k < K. Note that the dimension n plays no role in these expansions. 
Thus the term in equation Q has a deterministic part independent of the dimension and 
a random part that is a sum of terms of the form Q. The number of such terms depends 
only on K; call this quantity Ki. 

We set T^*) = V*P^^^V and consider a term of the form 

E| (T^gW ■ ■ ■T('=)g('^)T(^+^))(,.,i,)|^ (7) 

= y: ■ ■ ■ Qt'Q'^l ■ ■ ■ ■ ■ ■ QilW ■ ■ ■ Q'sJ} 

l,m,r,s 

xFT^^) _ _ _^(fe+i)^(i) (fe+i) (1) (fc+i)^(i) 

i* ,li i* ,mi mj.,j* i* ,ri r^^i* i* ,si sic,i* /' 

By Lemma im Ep^^qJT.^^^I™ < C^n-^'^+i)/^ in the DFT case for 1 < i,j < n. 
Lemma [23] delivers the same bound for lEw„, Pn,Q„ iT'jj- 1™ in the Haar case. 

(k + l)rj.{l) rp{k + l)rp{l) rp{k + l) rp{l) ^(fc + l)| 

\ i* ,li Ikt^* i* ,mi mk,i* i* ,ri rf^,i* i* ,si si^,i* I 
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(fc) |4(/c+l)Nl/(4(fe+l)) 



< (ElT.l'i • ■ ■ Elr/'^l l^^'^+i) ElT.l'i l^^'^+i) ■ ■ ■ EIT. 

^ „-(4(fc+l))/2 

We now bound 

Y: eq!;) ■ ■ ■ gjf qS ■ ■ ■ Ql^'M' ■ ■ ■ Qi'M' ■ ■ ■ Q?J- (8) 

l,m,r,s 

The expectations in line ([8]) are all less than or equal 1, so the summability is solely a 
question of how many terms in the sums there are. Since the Q^^^'s are independent and 
centered, the expectation of a product of Qt's is zero if there is not at least the square 
of each term or the non-zero term Q^-l Regardless of whether i* is an index in an 
expectation, the number of possible other indices is at most 2k, and for j = 1, 2k, 
there are nonzero expectations for j terms different from i*. Once j integers out of 
{1, ...,n} are chosen, the number of ways to assign them to Ak positions is independent 
of n. Call this number Cj^^k- The number of ways to choose j different numbers out of n 
is (") < n^- Set C^. = 2 maxi<j<2fc C'j,2fc- Then the term in line © is less than or equal 
to 

2k 

ECjikn^ < 2k max Cj^kn'^'^ 
l<7<2fc 

3=1 

Then 

P( I sum of all terms of the form ([61) | > e) 

< Ki \ one such term | > 

= i^i • P I one such term |^ > 

= Kle-^CK,C'Kn-\ (9) 

Since the terms ^ are summable with respect to n, the Borel-Cantelli lemma implies 
that the sum of the random terms 

E 7^(1)^(1) n^k)rj.{k + l) 

ie{l,...,n}*= 

converges almost surely to for all /c as n — )• oo. 

This argument is independent of which index we denote i*, and so the convergence 
is almost sure to the same constant independent of the placement of the zero. 



8 



By taking the limit as the dimension implicit in line Q tends to infinity, we obtain 
the constant Dk{z). Then {Dk{z)}'^^^ is a Cauchy sequence, and, if we denote its limit 

L'(z), then we have \Dk{z) - D{z)\ < 

To show the almost sure convergence we let e > be given. Choose K large enough 
so that < e/4 and \Dk{z) — D{z)\ < e/4. In the following calculation the matrices 
are of dimension n, as indexed by the sum. Using ^ we have 

oo 

Y,n\[W*A-'W\u-D{z)\>e) 

n=l 

oo oo 

= Y.^{\[V*PY,{-^)\PVQV*PfPV]u-D{z)\>e) 

n=l A;=0 

oo K 



n=l k=0 



+ \Dk{z) - D{z)\ + \[V*P J2 i-z)\PVQV*P)''PVU > e) 

k=K+l 

oo K 

< J2^{\[V*Pj2i-^)'iPyQV*P)'PV],, - Dk{z)\ > e/2) 

n=l k=0 

oo 



n=l 



which is finite. The Borel-Cantelli lemma now gives the almost sure convergence of 
[W*Ar^W]u to D{z) for all i. □ 

The following lemma is essentially due to Tulino, Verdu, Caire and Shamai, and was 
developed in their work on (deterministic) Toeplitz matrices conjugated by a random 
Bernoulli projection matrix, which they call 'erasure matrices' [[T7l . The manipulations 
and the insight concerning the term [iy*y4j~^W^]jj are theirs. However, [iy*y4~^iy]jj has 
a different form in the work presented here. As a consequence, the proof of Lemma [231 
requires the preceding lemmas, which are our own. The proof given here is also self- 
contained. Equation (flOl) certainly holds more broadly than just the case covered in [17l 
and the work presented here. Similar general settings where such equations hold are 
proved in [[TSl . 



Lemma 2.5 The rj-transforms of PnFnQnF*Pn and PJJnQJ^nPn converge almost surely 
to the same function, which we denote rjp^g. This function is a solution to the implicit 
equation 

VpM = r]Q{z- z ^ J , (10) 
where rjq is the asymptotic r]-transform ofQn- 
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Proof We return to the set-up given in equations (HI) and Q. We again at first let V stand 
for an arbitrary unitary matrix. Recall that Wn = PnVn and that Wi is the z*'* column of 
Wn. As defined in © and Q 

Ai = I + zPVQV*P - zQ,^,WiW* 
Ai is invertible for z ^ [—1,0], and one can verify directly that 

Now we multiply both sides of equation (fTTI) by zQi^iWiW* and obtain 
zQi,iW,w*iI + zPVQV*P)-^ 



1 + 



1 + 2;Qi,iW*v4. 'ly, 
Summing over i gives 



zPVQV*P{I + zPVQV*P)-^ = ^zQi^iW,w*{I + zPVQV*P)-^ (12) 



= y njMA^\ (13) 

We use the following observation: let M G C"^" be a positive matrix and Ai, its 
eigenvalues. Then 



1 " 1 

1 A 1 + A, 1 

^ 1 + Ai n ^ 1 + Aj 



1=1 

" l + Ai Iv^ Ai 



= 1 -tr(M(J + M)-^). 

Similarly, using equation (fT3l) , 

tr((/ + 2Pygy*P)-i) = 1 -tr(zPFQF*P(/ + 2P\/QK*P)-^) 
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tr{{I + zPVQV*P)-^) = l-tr{zQV*P{I + zPVQV*P)-^PV* 

A zQi^iWiW*A~'^ 



1 - tn 



i=l 

We note 

tr{V* P{I + zPVQV*P)-^PV) 
= tr{P{I + zPVQV*P)-^P) 



1 + zQi,iW*A-'w^ 
}_ zQi^iW*A~^Wi 

1 1 

iV" t 

1 + zQuw*A-h 



1 



1 

tr((/ + zPVQv*p)-') --Y1 = + zpyQy*p)~%,i 

i=l 
1 " 

tr((/ + zPVQV*P)-') ^i^^'^ = 0}' (14) 



where the last line follows from using that [(/ + zPVQV*Py^]i^i = 1 when Pj j = 0. 

Lemma [241 states that when Vn = Fn or when it has Haar distribution w*A^^Wi 
converges almost surely to a number D{z) independent of i as n — )• oo. This constant 
is the same for both the Fourier and the Haar case. Writing E for both IEp„,Q„ and 

7]p^q{z) = \imKtr{{I + zPVQV*P)-^) 

= limE-> — . 

„^oo n ^ 1 + zQi^iW*Ai 'wi 

Let Cz be the decay constant given in Lemma For each i we write w*A^^Wi = 
D{z) + Xn,i, where D{z) is the deterministic part independent of dimension, as in 
Lemma [231 

1 " 1 

n ^ 1 + zQi^i{D{z} + Xn,i) 

< F(-> > 

- ^^n^^l + zQi,iiD{z)+X^,i) nj^^l + zQi^iD{zy 



< 



'^^nj^^l + zQi^i{D{z)+Xn,i) nj^^l + zQi^iD{zy 2' 
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+nii|:^^^^-.«pw.)i>^) (15) 



i=l 

n 



^' ^'^ +zQi^iD{z)){l + zQ,,,{D{z)+Xn,i)y 



-Ynr- 

' 1 

i=i ^ 



< 2''C,CdCrn-\ (18) 



where we use the exponential Bernstein bound for the term in line (|T5l) and incorpo- 
rate it into the 2 in line (fT6l ). For the last inequality (fTSl ) we use to take care of the 
denominator, which is possible for 2 in a small enough circle around the origin. As in 
Lemma we truncate the infinite sum implicit in each X„ j at some index K and col- 
lect the remainder in the term Cj.- The remaining terms in the numerator are all products 
of terms of the form (|7]) with powers summing to 4, and so the work of Lemma [24] ap- 
plies. We use Lemma for the final inequality. Using the Borel-Cantelli lemma, we 
now have the almost sure convergence 



1 

1=1 



+ zQiiW*A^ ^Wi 



^ t1q{zD{z)). 



Using this and Lemma [241 again. 



D{z)n iz) = UmE-V 



i=l 

1„ 



= \mv^-y" w*{I + zWQW*} ^Wi 

i=l 

= Um Etr(iy*(/ + zWQW*)-^W) 

n— >oo 

= Vp,q{^)-P, (19) 

where equation ([19] ) follows from taking the limit with respect to n of the expectation of 
equation ([141) . □ 



Proposition 2.6 Let Pn and Qn be independent Bernoulli as defined above, with ex- 
pected traces 1 — p E (0, 1) and 1 — q E (0, 1) respectively. Then the rj-transforms of 
PnFnQnF*Pn and PnUnQrM^Pn converge almost surely to the asymptotic rj-transform 

. . 1 + (p + q)z + a/I + (2(p + q) - 4:pq)z + ((p + qV - Apq)z'^ 

= WTV) • 

Proof The proof is the same for both types of matrices, so we write it only for the 
DFT case. The matrices FnQnF* have only eigenvalues 1 and 0, and their limiting 
r^-transform is 

riFQF*[z) = riQ{z) = . 
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Applying equation (flOl) from Lemma [231 yields 



1 + q{z — z 



p 



which leads to the equation 



(1 + z)vlq{z) - (1 + (p + q)z)ripAz) + pqz = 0. 
This equation has the solutions 

l + {p + q)z± v/(l + (p + q)zy - 4(1 + z)p^ 
2{l + z) ■ 

Noting that 77p,g(0) must equal 1, we choose the solution with addition. We thus have 

_ l + {p + q)z+ ^1 + {2{p + g) - Apq)z + {p - qfz^ 
VpA^)- 2(1 + z) ■ ^ ^ 

□ 



3 Limiting Empirical Distributions 

Theorem 3.1 For i = 1, ...,n let i be contained in independently with probability 
(1 — q) and, also independently, let i be included in Tn with probability (1 — p). Then 
the empirical distributions of the min(|T„|, \Qn\) ^'^f'S^^t eigenvalues of F^^XnF^„T„ 
Wa^Tn^nnTn converge almost surely to 



f ( \ V ^ ^ r / N , max(0, 1 - (p + g)) 

= 2vr(l-x)(l-max(p,g)) " ^(^'^^^^^^ + l-max(p,g) " " ^'^^ 

where 

r_ = (Vp(i-g)- Vg(i-j>))' 

r+ = (v/p(l-g) + Vg(l-p))'. 

Note that r_ = only when p = q\ that is, when F^^T„F^„Tr, or Wf7„r„Wj^„T„ is asymp- 
totically square. Therefore, when p ^ q the support of the limiting distribution begins at 
r_ > 0. Also, r_|_ = 1 only when p + q = 1, so when p + q ^ 1 there is a gap in the 
support of the limiting distribution from r+ to 1 . When r_ 7^ and r+ 7^ 1 the contin- 
uous part of the measure begins at r_ and makes an arc ending at r+. When r_ = or 
r+ = 1, the continuous part of (|2TI) tends to 00 as a; — > or x — > 1. When p + q < 1 
there is a point mass of measure 1 — (p + g) at 1, and when p + q > 1, the support stops 
at r+ and there is no point mass at 1. In Figure 1 the continuous part of the asymptotic 
distribution is plotted against empirical values for several parameter pairs p and q. 
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Figure 1 : Empirical eigenvalue distribution plotted against continuous part of asymptotic 
distribution. In plots (a) and (b) the matrices are submatrices of a random unitary matrix 
with Haar distribution. In plots (c) and (d) the matrices are submatrices of the DFT 
matrix. In each case, the original matrix dimension was 100/(1 — max(p, q)), so that the 
submatrices all had expected dimensions 100 x 100. Each random matrix was realized 
100 times. 



We discuss the relationship between Theorem 13.11 and uncertainty principles and 
other areas before turning to the proof. We focus first on the DFT case and assume 
that the dimension is n. The norm of a DFT matrix with a set of rows and columns 
removed equals 1 if and only if there exists a vector with time support corresponding 
to the remaining columns and frequency support corresponding to the remaining rows. 
That is, denoting Fx = x, in the notation of this paper, II-FctII = 1 if and only if there 
exists X G C" such that supp(s) C T and supp(a;) C ^l. One is then interested in 
determining conditions on the cardinality of and |T| such that H-FhtII < 1j where 
generally at least one set is random and a statement is made in some probabilistic form. 
This non-asymptotic question has been studied intensively over the last ten years. Recent 
results and a general discussion can be found in lfT4l . While Theorem 13. II does not assert 
the non-existence of any vectors, it does state when certain vectors do exist and sheds 
light on one of the main theorems in this area, namely that of Tao. 
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Theorem 3.2 (Tao, Ifl3ll ) Ifn is prime and \ Vt\ + \T\ < n, then \\Fq_t\\ < 1- 

ini ITI 

Note that since Tao's theorem requires n to be prime, it precludes the case that „ + „ = 
1. Theorem 13. II says that if ^ + ^ 1, then 11 -^n„T„|| converges almost surely to 
1, and in fact this also holds for random unitary matrices Un^Tn^^n^Tn- Almost sure 
convergence and the existence of infinitely many primes imply that for all ei, £2 > 0, 
there exists a prime n and sets i7„, T„ C {1, ...,n} such that + '-^ < 1 + ci and 
||-^c„T„|| > 1 — £2- In fact, the proportion of subsets for which ||-Fb„T„|| < 1 — £2 
converges to as n — )• 00. Moreover, as n increases, the Haar measure of the set of 

I n I IT I 

unitary matrices U such that ||f/n„T„|| > 1 — £2 for random sets satisfying '-^ + '-^ < 
1 + ci also converges to 1. 

A further area of interest is the smallest eigenvalue of F^,,T,,F^^rp^ or W^^T^Wj^^^^; 
in particular, one would like to bound the smallest eigenvalue away from 0. See [fT2ll for 
recent results in the setting of independent matrix entries. While Theorem 13. ll does not 
make any statement on when the smallest eigenvalue is strictly positive, it does say that 
if ^ 1' then the smallest eigenvalue of F^^t,,F^„^^ and Kn„T„U^„T„ converge to 0. 
This corresponds to the behavior of square matrices with independent entries, though in 
that case non- asymptotic bounds away from exist [[T2l . 

Theorem 13.11 and some numerical experiments suggest the obvious conjecture that 
the largest and smallest eigenvalues converge to the edge of the limiting support. This 
would imply that Tao's result gives the general uncertainty principle behavior for DFT 
and random unitary matrices, and that the submatrices that do not have this behavior have 
measure zero asymptotically. As was the case for the Wigner and Wishart distributions, 
we hope that the limiting empirical distribution is helpful in determining the behavior of 
the extremal eigenvalues. 

We note, lastly, that a potential further step in this direction is restricted isometry 
properties. Here one set, say Tn, is taken at random and one seeks to bound the extremal 
eigenvalues of Fo^T„-Pa„T„ or ^n^T^W^^^^ when Q,n ranges over all subsets of certain 
cardinality. This property of random matrices is central to compressed sensing and has 
received enormous attention in recent years; see yl for a recent overview. 
Proof [Proof of Theorem 13. 11 1 We give the proof using the notation for the DFT case, 
but the two are identical. We begin by working with PnFnQnF*Pn, where P„ and Qn 
are a sequence of independent Bernoulli diagonal projection matrices, as defined earlier, 
with expected traces 1 — p E (0, 1) and 1 — g G (0, 1) respectively. By Proposition 12.61 
we have 

1 " 1 

for all 2; G C+. By applying this to — i, multiplying both terms in (fllj) by — ^ and using 
equation ([3]), we obtain 

1 1 

I- V - mp^^{z)\ ^ 0. 



15 



Thus, the random measures induced by the eigenvalues at each dimension n converge 
almost surely to the probability measure corresponding to nip g [1] (Theorem 2.4.4(c) 
in [[U also holds for almost sure convergence). By equation Q the limiting Stieltjes 
transform is 



_l 1 - (p + g)i + Jl + (2(p + q)- 4m)(^) + {p - g)2(l)2 
= ^7^3^ 



l-{p + q)l+ Jl- {2{p + q)- Apq)\ + (p - qf^ 



2(1 -z) 

and we are interested in the inverse Stieltjes transform 

lim — 55 m„ „(x + icj). 

^p,q{x + iio) is a continuous function of both x and a; for a; G (0, 1). Thus, using 
equation ([T]) for x G (0, 1), 



^ ^l - {p + q)l + ^l - {2{p + q) - Apq)l + {p- qf^ 
fpA^) = 2(1 -x) 



1- 2 p + g -4pg i+ p-g 2J_ 

$5 ^ ^ -. (23) 

2n{l-x) 

Imitating Marcenko and Pastur [fTOl, we denote the roots of the equation x^ — (2(p + 
q) — Apq)x + (p — g)^ = by r_ and r+. These values are 

_ (2(p + q)- Apq) - a/(2(p + g) - Apqf - 4(p -"gp 

~ 2 

_ (2(p + g) - 4pg) - 2a/ (p + g - 2pg - {p - q)){j) + q - 2pq + {p - g)) 

2 

{2{p + g) - 4pg) - 2 v/(2g(l - p))(2p(l - g)) 

2 

= p(l - g) + g(l - p) - 2^/p{l - g)g(l - p) 

= (Vp(i-g)- Vg(i-P))' 



and r.,. = {^/p{l — q) + A/g(l — p)Y, as defined in the statement of the theorem. For 
X G (0, 1) equation (123]) is now 



^ ,,'1 - (2(p + g) - 4pg)i + (p - g)2 J, 



27r('l -x) 



^^^^^TTT^^^^ (24) 
27r(l — x) 
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27r 1 - X 



We now determine the density at a; = 0, for which we need to find the asymptotic 
proportion of zero eigenvalues of P„i^„Q„F^P„ or PnUnQrMn^n- This proportion is 
given by limr-^oo '7p,q('^), since the latter quantity gives the measure of the set with 
respect to the measure fp^q. We have 



I + (p + q)r + ^y 1 + (2(p + q) - 4pq)r + (p - q^r'^ 
nm ri„n{r) = lim 

r— s>oo r— s>oo 2(1 + r) 

{p + q) + \p-q\ 
2 

= max(p, g). (25) 

Lastly, we investigate the point x = 1. We denote by /i(l) the measure of the set 
{1} with respect to the limiting distribution. Let X denote the random variable of the 
eigenvalues. We first must address the convergence of SfcLo(~^)'^^"^^- If P + 7^ 1 
and p q, then < r_ and r+ < 1, and 



/(I _ _ I) rr+ 







V^LAE^!^/, , II . ^fc+l + n(l) 



If p + g 7^ 1 and p = q, which implies r_ = 0, then we pick a small e and have 



¥.X^ < I e^^- jlJIjl. Ldx 

27r(l-x) 



+11 2.(1-:) ' ^i..-.iii~7o '^^^""^'^ 



When p + q = 1 and when z belongs to a region of convergence to be determined 
shortly, equation ^ implies 

oo 

Since we have assumed that neither p or q equals 1 or 0, we have r_ < 1. We show 
that the sum on the left side of equation (l26l) converges for \z\ < l/r_. We define 
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k k even 
k — 1 k odd 



El 

A:=0 



fc=0 
2 



k=0 
2 

E 



fe=0 
2 

s E 

fc=0 
2 

s E 



fc=0 
2 

E 



27r(l-a;) 

oo „i 

fc=3 
oo 



'i - 1 

„fc V ^ 

27r(l -x^ 



fc=3 



27r(l -x) 

, oo „i 

EX^' + — yi^lM x'=-i/=^(l-x)-i/2rfx 
f-^ Jr, 



k=3 



\7\ °^ 
fe=2 



(27) 



fc=2 

oo 



.^0 2(LfcJ.-j) + 3 
The integral in line (1271 ) is given by equation 2.221 in [8]. One may verify that 



2(n-A;) + 3 2(n - A; + 1) + 3 \k!^ ^ 



n 



n 
k-1 



(1 - r+) 



n-fc+l 



SO that 



We 

J 



(l-r_) 



Thus 



fc=0 



A;=2 



which converges for all \z\ < l/r_. This gives 

oo 

fi{l) = {l + z)r^,,,{z)-{l + z)J2^' 



k=0 



27r(l-x) 



c/x, (28) 



where equation (|28l) holds for all values \z\ < 1, and the sum on the right side of equa- 
tion (|28] ) remains finite as z — )■ —1. Convergence of the necessary sums and integrals is 
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now established for all J9, g G (0, 1). Using the equation for i]p,q{z) from Proposition [2]6l 



1 + {p + q)z+ ^1 + (2(p + g) - 4pq)z + ((p + - Apq): 

\L + Z)rip^q[Z) — 2 

Allowing 2; —7- —1, we have 



^(1) = ^ 

When p + g > 1, equation (|29l ) is equal to 0, and when p + g < 0, equation (|29l ) is equal 
to 1 — min(p, g). From equations (l25l) and (|29l ). it follows that when p + g > 1 



_i) 

/ ^wf ^ = 1 ~ max(p, g), 

Jr_ 27r(l - x) 

and when p + g < 1, 



27r(l - x) 



'■+,,,. 

cix = min(p,g). 

/7r(l — x) 

Now it only remains to remove the point mass at and normalize the distribution by 

1/(1 — max(p, g)). □ 

For the following corollary we define the n singular values of the matrix PnFnQn to 
be the (positive) square roots of the n eigenvalues of the matrix PnFnQnF*Pn- We thus 
have the following limiting distribution for the singular values of PnFnQn- 

Corollary 3.3 For i = 1, ...,n let i be contained in i7„ independently with probability 
(1 — g) and, also independently, let i be included in Tn with probability (1 — p). Then 
the empirical distributions of the min(|T„|, |f2„|) largest singular values of F^^t^ '^^d 
Wf7„T„ converge almost surely to 



fs ( ^_ V ^'(1 - - 1) , , , , max(0,l-(p + g)) _ 

- 2vr(l-x2)(l-max(p,(7)) (^-'^+)^^^+ 1 - max(p, ' ^^"^ ' 

where 

r'_ = IVMI -g) - \/g(l 

and 

rl = \/p(i - q) + \/g(i -p). 

Proof The measures for the singular values and 1 are the same as the eigenvalues 
and 1 of the previous theorem. Also, the continuous part of the measure will clearly have 
support [rf_, r^] = [y/rl, y/^] - Using equation (l24l) . we have 
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/(1_ 

" ^ 27r(l-a;2) 



Trfl — 



a 
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